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Use is provided for a 7or-hydroxy or 7-oxo substituted 3^-hydroxy-steroid possessing the carbon skeleton of cholesterol, androsterone, 
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USE OF 7 ALPHA-SUBSTITUTED STEROIDS TO TREAT NEUROPSYCHIATRY, IMMUNE OR ENDOCRINE 
DISORDERS 

The present invention relates to novel uses for 7a-hydroxy-substituted steroids, to 
a process for preparing such steroids and to novel steroids so produced. 

In particular the invention relates to the use of cytochromes of the cytochrome 
5 P450 family designated Cyp7b to effect 7a-hydroxylation of certain 3p-OH steroids so as 
to produce a 7a-hydroxy-substituted steroids. Certain of the 7a-hydroxy-substituted 
steroids so produced, as well the corresponding 7-oxo derivatives, are novel and form 
further aspects of the invention. The invention also relates to uses of these steroids, to uses 
of Cyp7b enzymes and to uses of novel macromolecular species, eg. antibodies and DNAs, 
1 0 which are biologically related to the Cyp7b enzymes. 

Cytochromes P450 are a diverse group of heme-containing mono-oxygenases 
(termed CYP's; see Nelson et al. y DNA Cell Biol. (1993) 12, 1-51) that catalyse a variety 
of oxidative conversions, notably of steroids but also of fatty acids and xenobiotics. While 
CYP's are most abundantly expressed in the testis, ovary, placenta, adrenal and liver, it is 
1 5 becoming clear that the brain is a further site of CYP expression. Several CYP activities 
or mRNA's have been reported in the nervous system but these are predominantly of types 
metabolizing fatty acids and xenobiotics (subclasses CYP2C, 2D, 2E and 4). However, 
primary rat brain-derived glial cells have the capacity to synthesize pregnenolone and 
progesterone in vitro. Mellon and Deschepper, Brain Res. (1993), 629, 283-292(9) 
20 provided molecular evidence for the presence, in brain, of key steroidogenic enzymes 
CYP1 1 Al (sec) and CYP1 1B1 (1 IB) but failed to detect CYP 17 (cl7) or CYP1 1B2 (AS). 
Although CYP21 Al (c21) activity is reported to be present in brain, authentic CYP21 Al 
transcripts were not detected in this tissue. 

Interest in steroid metabolism in brain has been fuelled by the finding that adrenal- 
25 and brain-derived steroids (neurosteroids) can modulate cognitive function and synaptic 
plasticity. For instance, pregnenolone and steroids derived from it are reported to have 
memory enhancing effects in mice. However, the full spectrum of steroid metabolizing 
CYP's in brain and the biological roles of their metabolites in vivo has not been established. 
Many aspects of brain function are modulated by steroids. Intracellular receptors 
30 for glucocorticoids (Cortisol, corticosterone) are particularly abundantly expressed in the 
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hippocampus (I), a brain region that plays a key role in specific aspects of memory 
formation, and which is an early and prominent target for dysfunction and damage in 
Alzheimer's disease (AD). While glucocorticoids regulate learning and memory, mood and 
neuroendocrine control, chrome glucocorticoid excess compromises neuronal activity, 
5 synaptic plasticity and eventually survival, particularly in the hippocampus. These findings 
prompted the suggestion that glucocorticoid-mediated neurotoxicity might underpin some 
age-related brain disorders, including AD, in which plasma Cortisol levels are markedly 
elevated (2). 

Conversely, dehydroepiandrosterone (DHEA), the most abundant steroid product 
10 of the human adrenal cortex, has been proposed to protect against disorders of the aging 
brain (3). Plasma levels of DHEA often show a striking age-associated decline which 
correlates with loss of cognitive function (4). In rodents, injection of DHEA or its sulfate 
into limbic structures improves post-training memory and enhances synaptic plasticity (5). 
DHEA and glucocorticoids thereby appear to exert inverse effects upon memory function 
15 and synaptic plasticity, and DHEA has been advocated as an endogenous 'anti- 
glucocorticoid'. However, despite considerable circumstantial evidence to support this 
contention, there is no evidence for a direct interaction between DHEA and glucocorticoid 

signalling pathways in neurons. 

Neurosteroidogenesis has been reported in isolated rat retina (8) and brain (9). In 
20 addition to the production of pregnenolone and DHEA from cholesterol, a variety of novel 
steroids are made in brain extracts or cultured brain cells, including 20a- 
dehydropregnenolone, 7«-hydroxy derivatives of pregnenolone and DHEA, progesterone, 
and both 3a- and 3P-hydroxy-5a-p reg nan-20-one (reviewed in Ref. 7). Androgens are also 
modified, particularly through the action of aromatase and a 5a-reductase (reviewed in Ref. 
25 10). However, the specific enzymes responsible for these and other transformations in the 
central nervous system have not been well characterized. 

As referred to above, several Cyps are present in the central nervous system ( 1 1 -22). 
Activities or mRNAs corresponding to key steroidogenic enzymes (23-25), in addition to 
Cy P 19 (aromatase) have been detected. Furthermore, mRNAs encoding the non-Cyp 
30 hydroxysteroid dehydrogenases (HSD) 3«-HSD, 3 P -HSD and 1 1 P-HSD have been reported 
in the central nervous system (25, 27-29). 
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To investigate regulation of brain function, studies reported in copending 
International Patent Application No PCT/GB95/02465, published as WO 96/12810, and in 
Stapleton el a! (J. Biol. Chem. 270, 29739 - 1995, December, 15 1995). focused on the 
hippocampus, a brain region important in learning and memory. A copy of the specification 

5 of International Patent Application No PCT/GB95/02465 has been filed with the priority 
documents filed in respect of this specification. 

That copending application, PCT/GB95/02465, describes and claims novel 
cytochrome P450 proteins designated Hct-1 . These Hct- 1 proteins have now been named 
as Cyp7b by the Committee on Standardized Cytochrome P450 Nomenclature and the 

0 name Cyp7b will be used in this application. 

The Cyp7b enzyme shares 39% sequence identity to hepatic cholesterol 7a- 
hydroxylase (Cyp7a) and lesser but significant homology with other steroidogenic Cyps. 
The postulated steroidogenic domain (30,31), found in many of these enzymes, is present 
in both Cyp7a and Cyp7b. Cyp7b mRNA is predominantly expressed in rodent brain, 

5 particularly in the hippocampus, unlike Cyp7a, which is liver-specific (31-33 and 
EP0648840 A2). 

The present inventors have now investigated the substrate specificity of Cyp7b and 
found that Cyp7b catalyses the introduction of a hydroxyl group at the 7a position in 
steroid substrates, particularly 3P-hydroxy steroids. Cytochromes Cyp7b are thus steroid 
0 hydroxylase enzymes having 7a-specificity. The ability to produce 7a-hydroxylated 
steroids is of major commercial importance, because such steroids are of particular use in 
the manufacture of pharmaceuticals (either as drugs per se or as intermediates), and in the 
manufacture of test kits and assays for pathological conditions associated with the presence 
of abnormal levels of endogenous enzyme, substrate or product. 
5 The abbreviation "DHEA" will be used herein to designate dehydroepiandrosterone, 

thus 7a-hydroxy-DHEA designates 7a-hydroxydehydroepi-androsterone 

The present inventors have identified substrate/product pairs associated with Cyp7b, 
particularly DHEA/7a-hydroxy-DHEA (7-HD), pregnenolone/7a-hydroxy-pregnenolone 
(7-HP) and P-estradiol/7a-hydroxy-p-estradiol (7-HE). They have also determined that 
DHEA concentration in brain tissue declines with age, whereas the concentrations of other 
brain steroids do not, and determined that the ageing process may be associated with 
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deficits in certain steroids and also with deficits in the concentration of Cyp7b itself. It is 
also believed that one of the products produced by Cyp7b mediated reactions, namely 
7a-hydroxy dehydroepiandrosterone, plays an important role in the operation of the 
immune system. Because 7a-hydroxy-DHEA is believed to be made substantially only in 
the brain, the inventors hypothesize that senescence may be due to a deficit in 
brain-produced 7a-hydroxy-DHEA as well as in other steroids found in the brain such as 
DHEA, pregnenolone and 7oc-hydroxy-pregnenolone. 

The present inventors have now further determined that one of the specific 
properties of the 7a-hydroxy-substituted steroids, and potentially their 7-oxo substituted 
steroid derivatives, provided by the present invention is that of glucocorticoid and/or 
mineralocorticoid antagonism, whether at receptor level or otherwise. This is particularly 
demonstrated by the Example 5 below with respect to 7a-hydroxy-DHEA but is more 
generally applicable. Thus this activity not only gives further uses for the novel steroids 
of the invention but prov.des first and second medical uses for known 7a-hydroxy or 7-oxo 
steroids made available by the present process as glucocorticoid and/or mineralocorticoid 
antagonists and preferably in antagonism specific to neuronal tissue such as in the CNS. 

Thus, having regard to this activity and their involvement in endogenous metabolic 
pathways, particularly in the brain, the 7a-hydroxy substituted 3P-hydroxy-steroids 
provided by use of the Cyp7b enzyme activity, including novel compounds provided by the 
inventus and their 7-oxo derivatives, have utility in the therapy of neuropsychiatry 
immune and endocrine disorders, particularly but not exclusively steroid associated 
disorders. 

Use of these 7a-hydroxy or 7-oxo substituted 3P-hydroxy-steroids, preferably 
possessing the carbon skeleton of cholesterol, androsterone, pregnenolone or estradiol, or 
derivatives thereof substituted independently at one or both of the 7- and 3- positions with 
an ester or ether group, in treating these disorders and for manufacturing medicaments for 
such treatment is provided in a first aspect of the present invention. Particularly preferred 
derivatives are those wherein one or both of the ester and or ether group is metabolisable 
in vivo to produce the corresponding hydroxy compound. 

Preferred derivatives include those wherein the steroid has a SP-substituent-OR! 
and/or a 7a-substituent -OR 2 where -OR^nd -OR 2 each independently represents a free 
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hydroxy, ester or ether group, 

wherein each of Rj and Rr> are independently selected from the group consisting 
of hydrogen, substituted or unsubstituted C^ 6 alkyl groups, groups R5CO-, wherein R $ 
may be selected from substituted or unsubstituted Cj_ 6 alkyl groups, and groups of the 
formula -OP(OH>3, wherein any substituents are selected from OH, halogen (F, CI, Br, I) 
amino, Cj. 6 alkylamino, C } _ 6 dialkylamino, COOH or COOR4 wherein R 4 represents a 
Cj. 6 alkyl group; and wherein the compounds may be in free form or in the form of acid 
addition salts with pharmacologically acceptable anions. 

The particular disorders for which this utility is provided include 



(a) 


deficits of cognition in aging 


(b) 


Alzheimer's disease 


(c) 


deficits of immune system in aging 


(d) 


deficits of immune function in HIV infection 


(e) 


glucocorticoid or mineralocorticoid excess 


(0 


diabetes 


(g) 


depression 


(h) 


osteoporosis and hypercalcemia 


(I) 


hyperglycemia and hyperlipodemia 


0) 


muscle atropy 


00 


artero sclerosis 


(1) 


steroid diabetes 



Further, these 7a-hydroxy steroids, their esters, ethers and 7-oxo derivatives may 
be used to induce cognitive enhancement in a normal individual. 

Preferred steroids for such use have the carbon skeleton of androsterone, 
pregnenolone or estradiol and particularly preferred examples are 7a-hydroxy-DHEA and 
7a-hydroxypregnenolone. Accordingly the present invention further provides the use of 
novel compounds of Formula la and lb shown below in the applications indicated above. 

Particularly preferred uses for the antagonistic properties of these 7-substituted 
steroids include treatment of disorders falling within category (e) above or where reversal 
of the effects of such corticoids, regardless of excess, is required. 
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A second aspect of the present invention provides pharmaceutical compositions 
implementing such use. The compositions in which the novel steroids and known steroids 
of the invention will be used will readily occur to those skilled in the art, generally 
comprising the steroid active in association with a pharmaceutical* acceptable carrier or 
5 diluent, with formulations for example being suitable for inhalation or for gastrointestinal 
(eg. oral), parenteral, topical, transdermal or transmural administration. 

As an alternative to administering the compounds of the invention per se, a third 
aspect of the invention provides the possibility of using the gene sequences of the Cy P 7b 
genes in gene therapy in order to compensate for a deficiency in Cy P 7b enzyme. In such 
1 0 therapies, constructs comprising Cy P 7b coding sequences can be packaged in conventional 
delivery systems, such as adenoviruses, vaccinia viruses, herpes viruses and liposomes 
and administered via a route which results in preferential targeting of a selected tissue, 
especially the brain. The invention further provides the possibility of using the gene 
sequences of the Cy P 7b genes in gene therapy in order to achieve the endogenous 
15 expression of Cyp7b sequences for other purposes, e.g. in order to promote immunogemc 
processes. Thus for example, a vector such as a suitably modified vaccinia v,rus 
(or variant thereof) may be co-administered with a vaccine formulation so that the 
expressed Cvp7b sequences augment the immunogemc propert.es of the vaccme. 

It will be realised that in the event of Cyp7b related borders other than those 
20 involving its depletion it may be desirable to use vectors containing antisense sequences 
to Cyp7b effective such as to inhibit Cyp7b expression. 

Macromolecules related immunologically to Cy P 7b enzymes form fourth and fifth 
aspects of the invention and in this regard antibodies, particularly monoclonal antibod.es 
which are capable of selectively binding Cy P 7b, have utility in the diagnosis of disorders 
25 (a) to (1) referred to above. Anti-Cyp7b antibodies (including monoclonal antibodies) as 
well as binding molecules comprising antibody fragments may be produced by known 
methods and used in test kits for assays for Cyp7b enzymes. 

According to a sixth aspect of the invention, there is provided a process of 
producing a 7a-hydroxy-substituted steroid which comprises subjecting a corresponding 
30 steroid substrate having no hydroxy, substituent in the 7-position to hydration .n the 
presence of a Cyp7b steroid hydroxylase enzyme. 

-6- 
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The Cyp7b steroid hydroxylase enzyme used in the process of the invention is 
preferably a Cyp7b enzyme described and claimed in the above-mentioned International 
Patent Application No PCT/GB95/02465 (and referred to therein as Hct-1). Such enzymes 
include (a) ones having the precise amino acid sequences described for mouse, rat and 
5 human Cyp7b, (b) homologous enzymes from other species and (c) enzymes having amino 
acid sequences which differ from the sequences of enzymes included in definitions (a) and 
(b), but in which the capacity to catalyse the introduction of a 7a-hydroxyl group is not 
eliminated. 

The amino acid sequence of suitable Cyp7b steroid hydroxylase enzymes may be 
1 0 defined in terms of the DNA coding sequences disclosed in International Patent Application 
NoPCT/GB95/02465. Thus the Cyp7b steroid hydroxylase enzyme may have a sequence 
encoded by DNA coding sequences of Cyp7b enzymes selected from 

(a) Coding sequences of DNA molecules comprising the coding 
sequence for rat Cyp7b set forth in SEQ Id No; 1 , 

1 5 (t>) Coding sequences of DNA molecules comprising the coding 

sequence for mouse Cyp7b set forth in SEQ Id No: 2, 

(c) Cyp7b steroid hydroxylase-encoding DNA molecules capable of 
hybridizing with the DNA molecule defined in (a) or (b) under 
standard hybridization conditions defined as 2 x SSC at 65 °C. 

20 ( d ) Cyp7b steroid hydroxylase-encoding DNA molecules capable of 

hybridizing with the DNA molecule defined in (a), (b) or (c) under 
reduced stringency hybridization conditions defined as 6 x SSC at 
55°C 

The sequences (a) and (b) above represent rat and mouse Hct- 1 gene sequence. 
25 Homologous sequences from other vertebrate species, especially mammalian species 
(including man) fall within the class of DNA molecules represented by (c) or (d). 
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Thus for human Cyp7b, the steroid hydroxylase enzyme may comprise a 
sequence encoded by 

(e) DNA coding sequences selected from the following: 

(i) the sequence designated "exon 3" in SEQ Id No 3, 
5 (ii) the sequence designated "exon 4" in SEQ Id No 3, and 

(f) Cyp7b steroid hydroxy lase-encoding DNA molecules capable of 
hybridizing with the DNA molecules defined in (e) under standard 
hybridization conditions defined as 2 x SSC at 65 °C. 

(g) Cyp7b steroid hydroxylase encoding DNA molecules capable of 
10 hybridizing with the DNA molecule defined in (e) or (0 under 

reduced stringency hybridization conditions defined as 6 x SSC at 
55°C. 

(h) Cyp7b steroid hydroxylase-encoding DNA molecules comprising 
contiguous pairs of sequences selected from 

, 5 (i) the sequence designated "exon 3" in SEQ Id No 3, 

(ii) the sequence designated "exon 4" in SEQ Id No 3, and 

(i) Cyp7b steroid hydroxylase-encoding DNA molecules capable of 
hybridizing with the DNA molecules defined in (h) under standard 
hybridization conditions defined as 2 x SSC at 65 °C. 

20 0) Cyp7b steroid hydroxylase-encoding DNA molecules capable of 

hybridizing with the DNA molecule defined in (h) or (i) under 
reduced stringency hybridization conditions defined as 6 x SSC at 
55°C. 
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(k) Coding sequences of DNA molecules comprising a contiguous 
coding sequence consisting of the sequences "exon 3" and "exon 
4" in SEQ Id No 3, and 

(1) Cyp7b steroid hydroxylase-encoding DNA molecules capable of 
5 hybridizing with the DNA molecules defined in (k) under standard 

hybridization conditions defined as 2 x SSC at 65 °C. 

(m) Cyp7b steroid hydroxylase-encoding DNA molecules capable of 
hybridizing with the DNA molecule defined in (k) or (1) under 
reduced stringency hybridization conditions defined as 6 x SSC at 
10 55°C. 



It will be appreciated that the DNA sequences referred to may consist of or be 
derived from genomic DNA, but typically would consist of or be derived from cDNA. 
Such sequences could be obtained by probing an appropriate library (cDNA or genomic) 
using hybridisation probes based upon the sequences provided according to the invention 
1 5 of International patent application No PCT/GB95/02465, or they could be prepared by 
chemical synthesis or by ligation of sub-sequences. 

In the above definitions, Cyp7b steroid hydroxylases have been defined in terms of 
DNA sequence information. The Cyp7b steroid hydroxylase enzyme used in accordance 
with the process of the invention may alternatively or additionally be defined by reference 
20 to amino acid sequence information, e.g. the amino acid sequences contained in SEQ ID 
NO. 4, SEQ ID NO. 5 or SEQ ID NO 6. 

Thus the Cyp7b steroid hydroxylase enzyme used in accordance with the process 
of the invention may have sequences matching one of said sequences exactly, or 
alternatively, the enzymes used may have sequences which differ from the aforementioned 
25 sequences, provided that the capacity to catalyse the introduction of a 7a-hydroxyI group 
is not eliminated. 

Thus, for example, mutant enzymes may be produced by known methods, for 
example site-directed mutagenesis or other PCR-based procedures, and the expression 
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products tested for their capacity to catalyse the introduction of a 7<x-hydroxyi group in 
selected substrates in accordance with the procedures described herein. 

Having regard to the degree of homology between the rat, mouse and human 
enzymes and known data relating to species divergence of hydroxylase enzymes, it is 
5 preferred that by comparison with the DNA sequences of SEQ ID NO. 1 . SEQ ID NO. 2 
and SEQ ID NO. 3, the mutant enzymes should should be encoded by sequences having at 
least 50% homology, more preferably at least 60% homology and most prefereably at least 
70% homology with said sequences over a length of 50 contiguous nucleotides. 

Preferably the mutant enzymes are encoded by sequences having at least 60% 
1 0 homology with the entire coding sequence, more preferably at least 70%. 

Alternatively, by comparison with the amino acid sequences of SEQ ID NO. 4, 
SEQ ID NO. 5 and SEQ ID NO.6, it is preferred that mutant enzymes should have at 
least 50% homology, more preferably at least 60% homology and most prefereably at least 
70% homology with said sequences over a length of 30 contiguous amino acids. Preferably 
1 5 the mutant enzymes have at least 60% homology and more preferably 70% homology or 
more with the entire amino acid sequence in each case. 

It is however preferred that such mutant enzymes do not differ too drastically from 
the aforementioned sequences and in this regard, where amino acid substitutions are made, 
that the substituted amino acids are preferably so-called "synonymous" or "conservative" 
20 substitutions, i.e. hydrophilic, hydrophobic, basic and acidic amino acids should preferably 
be substituted by amino acids in the same class (see US 5380712). 

More specifically, it is preferred that the mutant enzymes differ from the precise 
sequences of those described herein by not more than 20, preferably not more than 1 0 and 
most preferably not more than 5 amino acid substitutions, insertions or deletions. 
25 The Cyp7b enzymes described herein may be used in toxicological and drug 

evaluation studies and such uses form further aspects of the invention. In a particularly 
preferred embodiment of this aspect of the invention, a cell line capable of expressing a 
Cyp7b enzyme is used as a basis of an assay for one or more Cyp7b substrates. Such cell 
lines have utility in toxicological and drug evaluation studies. Most preferably the cell line 
30 comprises a prokaryotic or eucaryotic cell line which has been transformed so as artificially 
to express a Cyp7b enzyme. Examples include bacteria, yeast and mammalian cells. Also 
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included arc transgenic animals, at least one tissue of which (especially a non-brain tissue) 
expresses Cyp7b enzyme. Such transgenic animals may be produced by known methods 
for introducing foreign coding sequences into somatic or germ line cells. 

The substrates used in the method of the invention are characterised by possessing 
5 a 3P-hydroxyl group and further by preferably possessing the carbon skeleton of 
cholesterol, androsterone, pregnenolone or estradiol, with the proviso that where the 
substrate has the carbon skeleton of cholesterol, the substrate has a hydroxyl group in the 
25, 26 or 27-position, preferably the 25-position. 

Examples of such substrates include 25-hydroxycholesterol, dehydroepi- 
10 androsterone, pregnenolone and estradiol, in which case the steroids produced will be 7a- 
hydroxy-25-hydroxycholesterol, 7a-hydroxydehydroepiandrosterone, 7a-hydroxy 
pregnenolone and 7a-hydroxyestradiol (i.e. estra 1 ,3,5(1 0)-triene-3,7a,17(3-trioi) 
respectively. 

The 7a-hydroxylated steroid produced according to the invention may be oxidised 
15 by known enzymatic or non-enzymatic procedures to produce 7-oxo substituted steroids 
and this further process step forms a further aspect of the invention. 

Certain 7a-hydroxy-substituted steroids produced according to the invention and 
certain corresponding 7-oxo derivatives are novel and provide a further aspect of the 
invention. Thus the present invention further provides novel 3p-hydroxy steroids 
20 characterised in that they have a 7a-hydroxy or 7-oxo substitutuent. Preferred novel 
steroids have the carbon skeleton of cholesterol, androsterone, pregnenolone or estradiol, 
with the provisio that where the skeleton is that of cholesterol, the 25, 26 or 27 position is 
hydroxylated, most preferably the 25 position. 

Particular novel steroids are of the formula 



0R 3 0R 3 




25 la lb 
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wherein OR, , OR ? and OR 3 each independently represents a free hydroxy group, an ether 
group or an esterificd hydroxy group. 

In the case where OR,, OR 2 and OR3 each independently represents an ether 
group, each of R, , R 2 and R 3 may be selected from substituted or unsubstituted C, _ 6 alkyl 
5 groups, any such substituents being selected from OH. halogen (F, CI, Br, I) amino. C , _ 6 
alkylamino. C,. 6 dialkylamino, COOH or COOR 4 wherein R 4 represents a C,_ 6 alkyl 
group which may be unsubstituted or substituted by one of the substituents referred to 
above. 

In the case where OR,, OR 2 and OR3 each independently represents an esterified 
10 hydroxy group, each of R,, R 2 and R3 may have the formula R5CO-, wherein R5 may be 
selected from substituted or unsubstituted C , _ 6 alkyl groups, any such substituents being 
selected from OH, halogen (F, CI, Br, I) amino, C,_ 6 alkylamino, C,_ 6 dialkylamino, 
COOH or COOR4 wherein R 4 represents a C,. 6 alkyl group; and groups of the formula 
-OP(OH) 3 . Where compounds of Formula la or lb include substituents such as carboxyl 
1 5 groups, phospate groups, or substituted or unsubstituted amino groups, the compounds may 
be in free form or in the form of acid addition salts with pharmacologically acceptable 
anions (such as, for example, phosphate or halide ions) or cations (such as, for example, 
alkaline metal cations). Thus, where OR,, OR 2 or OR 3 represents hemesuccinate 
HOOC(CH 2 ) 2 CO. the resulting hemesuccinate may be in the form of. for example, an Na 
20 or K salt. 

It will be realised that the present invention provides for 7a-hydroxylated and 7-oxo 
steroids as described above but which are further substituted at other positions directly on 
the steroid skeleton. 

7a-Hydroxyestradiol and 7-oxoestradiol are specific examples of compounds of 

25 Formula la and lb. 

The invention will now be described in more detail with particular reference to the 

following Figures and Examples. 



Description of Figures 

Figure 1 illustratesanautoradiogramofaTLCplateuscd in an experiment to assess 

30 the ability of various cell extracts to hydroxylate DHEA. 
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Figure 2 depicts the ability of various tissues to release radioactivity from 7- J H- 
pregnenolone. 

Figure 3 illustrates the principal steroid interconversions mediated by Cyp7b. 
Figure 4 is a histogram plotting fold induction of luciferase expression with 
5 concentration of various steroids as described in Example 5. 

Figure 5 illustrates the attenuation of Cyp7b gene expression in Alzheimer's as 
described in Example 5, 

Figure 6 shows mass spectrometer plots of 7a-hydroxy-DHEA produced by the 
present process and a reference sample thereof. 

0 EXAMPLE 1 - Identification of substrate specificity of Mu Cyp7b 

A. Preparation of vaccinia expression construct 

To identify the reaction catalysed by Cyp7b acDNA encoding the mouse enzyme, 
reported by Lathe, Rose and Stapleton (PCT/GB95/02465) and by Stapleton et al. (J. Biol. 
Chem. 270, 29739-1995, December 15 1995), was modified to introduce a translation 
5 initiation consensus sequence at the 5' end of the Cyp7b open reading frame as described 
therein. The modified cDNA was introduced into the genome of vaccinia virus by 
recombinational exchange according to standard procedures (see, for instance, Gonzalez 
et al., Meth. Enzymol. 206, 85-92, 1991 and references therein) as described in Lathe et al. 

B. Production of Cyp7b enzyme extracts. 

Hela cells were grown to semi confluence (10^ cells per 5 cm dish; 5 ml medium) 
and infected with recombinant (VV-Cyp7b) and control (VV Copenhagen strain) vaccinia 
viruses at 0. 1 pfu per cell; 1 6 hours later infected cells were washed and taken up into W 
(Waxman) buffer (0.1 M KP04, I mM EDTA, 20% glycerol pH 7.5; 500 nl per plate) and 
recentrifuged (5 min., 1000 rpm). 

For whole cell extracts cells were resuspended into 1/100 volume (50 \il per plate) 
of W buffer and stored frozen at -70°C. For microsome preparation (Waxman, Biochem. 
J. 260, 81-85, 1989) cells were resuspended in 1/10 original volume of W buffer (500 \il 
per plate); sonicated 6x5 seconds on ice, and unbroken cells were removed by 
centrifugation (10 min.. 4°C 3000 rpm). 

The microsomal fraction was prepared from the supernatant by centrifugation 
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(100,000 g, 45 min., 4°C, Beckman SW50.1 rotor) and resuspended using a Potter 
homogeniser in 1/50 original volume of W buffer (100 ul per plate) before storage 
at -70°C. 

Control extracts were prepared from liver and brain from male rat by homogenising 
5 fresh tissue in W buffer (2.5 ml/g), clarifying briefly by centrifugation (4000 rpm, 5 min, 
4°C); the supernatant was stored at -70°C. 

C. Substrate identification by thin-layer chromatography. 

1«C or 3 H-labelled steroids were purchased from DuPont-NEN ( 14 C-labelled 
10 molecules: spec.fic activities 45-60 mCi/mmol.; 3 H: specific activities 70-100 mCi/mmol). 
1 nMol aliquots of labelled substrate were dried down, microsomes or cell and tissue 
extracts were added (25 to 50 ul), and diluted to a volume of 175 ul with W buffer. 

Reaction was started by the addition of 25 ul of 8 mM NADPH. After incubation 

at 37«C for 15 minutes the reaction was shaken with 500 ul of ethyl acetate (BDH). The 
15 orgamcphasewasremov e d,drieddown,andsu SP endedintol0^1ethylacetate. Aliquots 

(5 x 2 ul) were applied to thin layer chromatography (TLC) sheets (Merck) and developed 
in ethyl acetate/'n-hexane/acetic acid 16:8:1 (solvent system N of Waxman, Meth. Enzymol. 
206 462-476, 1991). After drying, chromatograms 14 C were visualised by exposure to 
X-ray film. 3 H -labe.led chromatograms were treated with EN* HANCETM (DuPont-NEN) 

20 spray prior to exposure. 



D. Results 

Figure I is an autoradiogram of a TLC plate run in solvent system N; substrate was 
3 H -DHEA and samples were extracted with ethyl acetate and dried prior to application to 
the TLC plate (origin at bottom of figure). Extracts were 1. Microsomes from Hela cells 

25 infected with control vaccinia virus (negative control); 2, Microsomes from Hela cells 
infected with VVCy P 7b; 3, Duplicate preparation of microsomes from Hela cells mfected 
with VVCyp7b: 4, Rat brain homogenate. 

As can be seen from Figure 1, microsomes from cells infected with recombinant 
vaccinia expressing Cyp7b converted 1 ^-dehydroep.androsterone (DHEA) to a lower 

30 mobility form most consistent with hydroxylation. Brain extracts yielded a product ot 
indistinguishable mobility, consistent with our earlier demonstration that Cy P 7b ,s 
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expressed in brain. From the relative mobility of the product we surmised that Cyp7b could 
be hydroxylating DHEA at the 7 position. Progesterone, corticosterone, Cortisol and 
testosterone were at best inefficiently metabolised, if at all. However, pregnenolone and 
estradiol were both converted by the enzymes, as was 25-hydroxy cholesterol, All these 
5 substrates are distinguished by a 30 hydroxy group. 

EXAMPLE 2 - Identification of the position of the modification by ^H-reJease. 

To Identify the position of the modification, %-pregnenolone (NEN) was employed 

in which the J H substitution was predominantly at the 7 position on the steroid backbone. 

Microsomal extracts were incubated with J H-pregnenolone under the same conditions as 
1 0 used earlier. Following reaction, labelled steroids were extracted with ethyl acetate (2 x 1 

ml), and discarded; release of J H into the aqueous phase was monitored by liquid 

scintillation counting. 

Referring to Figure 2, 7-^H-pregnenolone was incubated with extracts and assayed 

for release of radioactivity into the aqueous phase following extraction with ethyl acetate. 
1 5 Extracts were 1 , Microsomes from Hela cells infected with control vaccinia virus (negative 

control); 2, Microsomes from Hela cells infected with WCyp7b; 3, Duplicate preparation 

of microsomes from Hela cells infected with VVCyp7b; 4, Rat brain homogenate; 5, Rat 

liver homogenate. 

As seen in Figure 2 microsomes from cells infected with recombinant 
20 vaccinia expressing Cyp7b efficiently released J H into the aqueous phase. Brain also 
performed this reaction but not liver. Release of J H from the 7 position of pregnenolone 
demonstrates that Cyp7b hydroxylates pregnenolone at the 7-position to generate 
7-hydroxy pregnenolone (7HP); it may be concluded that Cyp7b also hydroxylates DHEA 
{to generate 7-hydroxy DHEA [7HD]) and estradiol to generate 7-hydroxy estradiol [7HE]. 

25 

EXAMPLE 3 - Stereochemistry of the Cyp7b hydroxy lati on. 

Steroids hydroxy lated at a variety of positions (egs. 2, 6, 7, 15, 16) differ in their 
mobility on TLC depending on whether the modification is in the a- or the P-position 
(Waxman, Meth. Enzymol. 206, 462-476, 1991). Purified 7a-hydroxy DHEA was 
30 obtained (kind gift of Dr. H. A. Lardy, Enzyme Institute, University of Wisconsin), mixed 
with the product of Cyp7b action on DHEA, and subjected to TLC. The product 
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grated with 7a- hydroxy-DHEA, demonstrating that Cyp7b is a 7a hydroxylase. 



EXAMPLE 4 - Activity of enzyme in 7a-hydroxylation of pregnenolone and DHEA 

To examine the catalytic activity of the enzyme Cyp7b CDNAs were expressed in 
mammalian cell lines. Cell extracts showed substantial NADPH-dependent conversion of 
5 DHEA (Km 13.3uM; Vmax 288pmol/min/mg) and pregnenolone (Km 3.6uM; Vmax 34 
pmol/min/mg) to slower migrating forms on thin layer chromatography. Products of 
identical mobility were generated by rat brain extracts . The expressed enzyme was less 
active against 25-hydroxycholesterol, ^-estradiol and 5a-androstane-3p\ 17P-diol, with 
low to undetectable activity against progesterone, corticosterone and testosterone. When 
10 [ 3 H-7a] pregnenolone was incubated with Cy P 7b extracts the extent of release of 
radioactivity into the medium suggested that hydroxylation was preferentially at the 7a- 
position. In gas chromatography and mass spectrometry of the modified steroid arising 
from incubation of DHEA with Cy P 7b extracts, the retention time and fragmentation 
patterns were identical to those obtained with authentic 7a-hydroxy DHEA (7HD); the 
1 5 reaction product also co-migrating with 7HD on TLC. 

Mass spectrometry: A lOx scaled up reaction was employed using 95% unlabeiled 
DHEA (Sigma) and 5% [14Q-DHEA (final specific activity 2.25-3mCi/mmol) and 
reaction time was extended to 1 hour. Product was purified by TLC, excised and extracted 
, with ethyl acetate before drying down. The dried residue and authentic 7HD (50mg) were 
70 converted to their methoxime -trimethylsilyl (MO-TMS) derivatives. Analysis of these 
products was performed using a Trio 1 00 mass spectrometer operating in electron tmpact 
(EI) mode, linked to a HP5890 gas chromatograph fitted with a HP-1 cross-linked methyl 
siloxane column (25m. i.d. 0.25mm, 0.17 mm film) under the following conditions: 
electron energy 70eV, source temperature 200°C, interface temperature 280°C, oven 
25 temperature 50°C increasing at 30°C per minute to 200°C, and then 10°C per mmute to 
300°C, injection temperture 280°C. 

EXAMPLE 5 - Cis-trans co-transfection assay; demonstration of antagonism. 

Chinese hamster ovary (CHO) cells were maintained and transfected in Dulbecco's 
30 modification of Eagle's medium (DMEM) supplemented with 15% foetal bovine serum. 
lOOIU/ml penicillin, lOOug/ml streptomycin and 200mM L-glutamine (all Gibco BRL, 
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Paisley, UK). 

24 hours prior to transfection CHO cells were plated at a density of 3 x 10^/60 mm 
dish (Costar UK). Cells were transfected by the clacium phosphate method. Briefly, 5\ig 
of MMTV-LUC and l^g of pRShGR or 5^g of pSV2 as a control for transfection 
efficiency were made up to a total of 10|ig/plate of DNA with pGEM3. 30^1 of 2.5M 
CaCh was diluted ten-fold with sterile water and 300^1 was added to the DNA. Next 300fil 
of 2 x Hepes buffered saline (280 mM NaCl, lOmM Kcl, 1 .5mM Na 2 HP0 4 .2H ? ), 50mM 
Hepes, 12mM dextrose, pH 7.05) was added slowly with swirling to the DNA/CaCI<-> 
mixture. This solution was left for 30 minutes in order for a fine precipitate to form and 
600|il was added dropwise to each plate. After 24 hours the medium was removed and the 
cells were washed in serum free medium and culture for a further 24 hours in medium 
containing 1 0% charcoal-stripped serum together with the appropriate concentrations of 
DHEA/7a-hydroxy-DHEA. 

Six hours after the addition of DHEA/7a-hydroxyDHEA either B or Dex was added 
to each plate. The following day the cells were washed in PBS, Iysed with 0.3ml of lysis 
buffer (25mM Tris-phosphate pH 7.8, 2mM DTT, 1% Triton X-I00 and 10% glycerol), 
scraped, centrifuged and the supernatant assayed in duplicate in a Berthold luminometer 
in a total volume of 250fil, comprising 40^1 of cell extract, 5|il of 30mM ATP, 100^1 of 
assay buffer (20mM tricine, 1.07nM (MgC0 3 )4.Mg(OH)2.%H 2 0 ! 2.67mM MgS0 4 , 
0.1 mM EDTA, 33.3mM DTT, 0.2mg/ml coenzyme A) and 105|il luciferin (Promega UK) 
injected to initiate the reaction. Light emission was measured over 10 seconds and relative 
light units/microgram of protein was calculated. 

Results are shown in Figure 4 wherein the fold induction of luciferase is illustrated 
by histogram for control, additions of DHEA, 7a-hydroxy-DHEA (7HD) alone and these 
additions in presence of an GR activating concentration of corticosterone. This result shows 
that 7HD ? but not DHEA, acts as an antagonist of coticosterone effect in activating the GR- 
mediated transcription. 

EXAMPLE 6 - Cyp7b expression in Alzheimcrs neurons 

Cryostat brain sections (10|im) from control and Alzheimer's hippocampus were 
cut, thaw mounted onto gelatine-subbed poIy-L-lysine coated slides and stored at -80°C. 
For in-situ hybridization studies, brain sections were post-fixed in 4% 
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paraformaldehyde by acetylation (0.25% acetic anhydride in 0. 1 M triethanolamine. pH 8.0) 
for 10 minutes, rinsed in phosphate buffered saline, dehydrated through graded alcohols 
and air dried. Hybridization was carried out using 200ul of [35 S ]-UTP-labelled cRNA 
antisense probe (10 x 10 6 dpm/ml in hybridization buffer) synthesized in vitro from a 500 
5 bp Xbal/PstI fragment of the human Cyp7b pMMCtl clone linearised with Xbal and 
transcribed with T3 RNA for sense probes. Sections were prehybridized with 20ul of 
prehybridization buffer ( as hybridization buffer but omitting the dextran sulphate) per slide 
at 50°C for 3 hours. 

Following hybridization with probe at 50°C overnight sections were treated with 
1 0 RNase A (30ug/ml, 45 minutes at 37°C) and washed to a final stringency of 0. 1 x SSC at 
60°C. Slides were dehydrated, dipped in photographic emulsion (NTB-2. Kodak) and 
exposed at 4°C for 5 weeks before being developed and counterstained with 1% pyronin. 
The density of silver grains was assessed over individual hippocampal neurons by 
computer-assisted grain counting using an image analysis system (Seescan pic, Cambridge, 
15 UK), with the analysis carried out blind (sections were cut and coded by a separate 
individual). For each slide, one hippocampal section represents each subject. 6-10 
neurons/subregion were assessed and background, counted over areas of white matter, was 
subtracted. Data were assessed by ANOVA followed by Scheffe post hoc test. Significance 
was set at p<0.05. Values are means ± S.E.M. 
20 Figure 5 is a histogram showing Cyp7b expression as indicated by grain count per 

neuron in the dentate gyrus. CA1 and CA3 subfields of Alzheimer's disease samples as 
compared to the age matched control brains. 
CONCLUSIONS 

It can be concluded that CypVb. and cognate enzymes from rat. human and other 
25 mammalian species, are 7a-hydroxylases specific for steroid substrates with a 3P hydroxy 
group. While activities for 7-hydroxylating DHEA, pregnenolone and cholesterol have been 
recorded previously in a variety of crude tissue homogenates (eg. Akwa et al., Biochem. 
J. 288, 959-964, 1992) no characterisation of the enzyme responsible was performed 
previously and no activity on estradiol was recorded. Recombinant organisms expressing 
30 Cyp7b thus provide a route to the large scale manufacture of 7HP, 7HD, and 7HE, 
principally but not exclusively for therapeutic use or for the production of further steroid 
derivatives such as 7-oxo molecules. 
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CLAIMS 



1 The use of a 7a-hydroxy or 7-oxo substituted 3p-hydroxy-steroid. or a derivative 
thereof substituted independently at one or both of the 7- and 3- positions with an ester or 
ether group, in the manufacture of a pharmaceutical composition for the therapy of 
5 neuropsychiatry immune and/or endocrine disorders or for inducing cognitive 
enhancement. 

2. The use according to Claim 1 wherein said disorders are selected from 
(a) deficits of cognition in aging 
1 o (b) Alzheimer's disease 

(c) deficits of immune system in aging 

(d) deficits of immune function in HIV infection 

(e) glucocorticoid or mineralocorticoid excess 

(f) diabetes 

1 5 (g) depression 

(h) osteoporosis and hypercalcemia 

(I) hyperglycemia and hyperlipodemia 

(j) muscle atropy 

(k) arterosclerosis 
20 (1) steroid diabetes 

3. The use as claimed in claim 1 or claim 2 wherein the steroid has a 3P-substituent- 
OR l and/or a 7a-substituent -OR 2 where -OR, and -OR 2 each independently represents a 

free hydroxy, ester or ether group, 
25 wherein each of R, and R 2 are independently selected from the group consisting 

of hydrogen, substituted or unsubstituted alkyl groups, groups R 5 CO-, wherein R 5 
may be selected from substituted or unsubstituted C,_ 6 alkyl groups, and groups of the 
formula -OP(OH) 3 , wherein any substituents are selected from OH, halogen (F, CI, Br, 1) 
amino. C, 6 alkylamino, C w dialkylamino, COOH or COOR 4 wherein R 4 represents a 
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Cj^ alkyl group;and wherein the compounds may be in free form or in the form of acid 
addition salts with pharmacologically acceptable anions. 

4. The use as claimed in any one of claims 1 to 3 characterised in that the steroid is 
one possessing the carbon skeleton of cholesterol, androsterone, pregnenolone or estradiol, 

5. The use of a Cyp7b steroid hydroxylase enzyme in the manufacture of a test kit for 
use in the diagnosis of neuropsychiatry, immune and endocrine disorders. 

6. The use according to Claim 5 wherein said disorders are selected from 



(a) 


deficits of cognition in aging 


(b) 


Alzheimer's disease 


(c) 


deficits of immune system in aging 


(d) 


deficits of immune function in HIV infection 


(e) 


glucocorticoid or mineralocorticoid excess 


(0 


diabetes 


(g) 


depression 


(h) 


osteoporosis and hypercalcemia 


(I) 


hyperglycemia and hyperlipodemia 


0) 


muscle atropy 


(k) 


arterosclerosis 


(1). 


steroid diabetes 



7. An antibody, especially a monoclonal antibody, characterised by specifically 
binding Cyp7b enzymes. 

8. The use of an antibody as claimed in Claim 5 in a test kit for assaying for the 
presence of Cyp7b enzymes. 

9. The use of Cyp7b coding sequences or antisense sequences in the manufacture of 
a targeted drug for gene therapy of Cyp deficiencies or excesses or for promoting 
immunogenic processes. 
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1 0. The use claimed in Claim 9 wherein a vector is co-administered with a vaccine 
formulation, whereby on administration, a Cyp7b sequence is expressed and the produced 
expression product augments an immunogenic property of the vaccine. 

11. A process of producing a 7cc-hydroxy-substituted steroid which comprises 
subjecting a corresponding steroid substrate having no substituent in the 7-position to 
hydroxylation in the presence of a Cyp7b steroid hydroxylase enzyme. 

12. A process according to Claim 1 1 wherein the enzyme is a mouse, rat or human 
Cyp7b steroid hydroxylase enzyme. 

13. A process according to Claim 1 1 wherein the Cyp7b steroid hydroxylase enzyme 
has a sequence encoded by DN A coding sequences of Cyp7b enzymes selected from 

(a) Coding sequences of DNA molecules comprising the coding 
sequence for rat Cyp7b set forth in SEQ Id No: 1, 

(b) Coding sequences of DNA molecules comprising the coding 
sequence for mouse Cyp7b set forth in SEQ Id No: 2, 

(c) Cy P 7b steroid hydroxylase-cncoding DNA molecules capable of 
hybridizing with the DNA molecule defined in (a) or (b) under 
standard hybridization conditions defined as 2 x SSC at 65°C. 



(d) 



Cyp7b steroid hydroxylase-encoding DNA molecules capable of 
hybridizing with the DNA molecule defined in (a), (b) or (c) under 
reduced stringency hybridization conditions defined as 6 x SSC at 



55°C. 



14. A process according to Claim 1 1 wherein the Cyp7b steroid hydroxylase enzyme 
has a sequence encoded by DNA coding sequences of Cyp7b enzymes selected from 
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(e) DNA coding sequences selected from the following: 

(i) the sequence designated "exon 3" in SEQ Id No 3. 

(ii) the sequence designated "exon 4" in SEQ Id No 3. and 

(0 Cyp7b steroid hydroxylase-encoding DNA molecules capable of 
5 hybridizing with the DNA molecules defined in (e) under standard 

hybridization conditions defined as 2 x SSC at 65 °C. 

(g) Cyp7b steroid hydroxylase encoding DNA molecules capable of 
hybridizing with the DNA molecule defined in (e) or (f) under 
reduced stringency hybridization conditions defined as 6 x SSC at 

10 55°C. 

(h) Cyp7b steroid hydroxylase-encoding DNA molecules comprising 
contiguous pairs of sequences selected from 

(i) the sequence designated "exon 3" in SEQ Id No 3, 

(ii) the sequence designated "exon 4" in SEQ Id No 3, and 

1 5 (i) Cyp7b steroid hydroxylase-encoding DNA molecules capable of 

hybridizing with the DNA molecules defined in (h) under standard 
hybridization conditions defined as 2 x SSC at 65 °C. 

(j) Cyp7b steroid hydroxylase-encoding DNA molecules capable of 
hybridizing with the DNA molecule defined in (h) or (i) under 
20 reduced stringency hybridization conditions defined as 6 x SSC at 

55°C. 

(k) Coding sequences of DNA molecules comprising a contiguous 
coding sequence consisting of the sequences "exon 3" and "exon 
4" in SEQ Id No 3, and 
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(I) Cyp7b steroid hydroxylase-encoding DNA molecules capable of 
hybridizing with the DNA molecules defined in (k) under standard 
hybridization conditions defined as 2 x SSC at 65°C. 

(m) Cyp7b steroid hydroxylase-encoding DNA molecules capable of 
5 hybridizing with the DNA molecule defined in (k) or (1) under 

reduced stringency hybridization conditions defined as 6 x SSC at 
55°C. 

15. A process according to Claim 1 1 wherein the Cyp7b steroid hydroxylase enzyme 
has a sequence encoded by DNA coding sequences of Cyp7b enzymes selected from the 

10 amino acid sequences contained in SEQ ID NO. 4. SEQ ID NO. 5 or SEQ ID NO 6 or a 
sequence which has at least 50% homology with one or more of the aforementioned 
sequences, provided that the capacity to catalyse the introduction of a 7a-hvdroxyl group 
is not eliminated. 

16. A process according to Claim 15 wherein the Cyp7b steroid hydroxylase enzyme 
15 has a sequence encoded by a DNA coding sequence which has at least 60% homology, and 

preferably at least 70% homology with one or more of the aforementioned sequences, 
provided that the capacity to catalyse the introduct.on of a 7a-hvdroxvl group is not 

eliminated. 

20 1 7. A process according to Claim 1 5 wherein the Cy P 7b steroid hydroxylase enzyme 
has a sequence which differs from the amino acid sequences contained in SEQ ID NO. 4, 
SEQ ID NO. 5 or SEQ ID NO 6 by not more than 20, preferably not more than 10 and most 
preferably not more than 5 amino acid substitutions, insertions or deletions. 

,8. A process according to any preceding claim wherein substrate is a steroid 
25 possessing a 3 (J-hydroxy 1 group. 

19 A process according to any preceding claim wherein the substrate is a steroid 
possessing the carbon skeleton of cholesterol, androsterone, pregnenolone or estradiol, with 
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the proviso that where the substrate has the carbon skeleton of cholesterol the substrate has 
a hydroxyl group in the 25, 26 or 27-position. 



20. A process according to Claim 19 wherein the substrate is 25-hydroxycholesterol, 
dehydroepiandrosterone, pregnenolone or estradiol. 

5 21. A process according to any preceding claim wherein the produced 7a-hydroxy- 
substituted steroid is 7a-hydroxyestradiol, 7a-hydroxypregnenolone or 7a- 
hydroxydehydroepiandrosterone. 

22. A process according to any preceding claim wherein produced steroid is subjected 
to an oxidation step to convert an H,OH to an oxo group. 

10 23. A steroid of the formula 



OR 3 



0R 3 



OR, 




0R 2 



OR, 



0 



la 



lb 



wherein ORj, OR2 and OR3 each independently represents a free hydroxy group, an ether 
group or an esterified hydroxy group. 



24 



A steroid according to Claim 23 wherein 



15 



each of Rj, R2 and R 3 may be selected from substituted or 
unsubstituted C j .g alkyl groups, any such substituents being selected from 
OH, halogen (F, CI, Br, I) amino, (Cj.^ alkylamino, Cj.g dialkylamino, 
COOH or COOR4 wherein R4 represents a C j_g alkyl group which may be 
unsubstituted or substituted by one of the substituents referred to above; or 
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ORi 0R 9 and OR - each independently represents an esterified 
hydroxy group, of the formula R 5 COO-, wherein R 5 may be selected from 
substituted or unsubstituted C,_ 6 alkyl groups, any such substituents being 
selected from OH, halogen (F, CI, Br, I) amino. C U6 alkylamino, C,_ 6 
dialkylamino, COOH or COOR 4 wherein R 4 represents a C j _ 6 alkyl group; 



or 

OR j, OR 2 and OR3 each independently represents an esterified 
hydroxy group of formula -OP(OH)3, 
or a pharmacologically acceptable salt of such a compound. 

10 25. 7a-Hydroxyestradiol or 7-oxoestradiol. 

26. A steroid as claimed in Claim 23 characterised in that it is a 3p-hydroxy steroid. 

27. A process of producing an oxo-substituted steroid which comprises subjecting 7a- 
hydroxyestradiol, 7a-hydroxypregnenolone or 7a-hydroxydehydroepiandrosterone to 
oxidation. 

15 28. A method for treating a human or animal requiring therapy for a neuropsychiatric, 
immune and endocrine disorder or for inducing cognitive enhancement comprising the 
adminsitration of an effective amount of a 7a-hvdroxy or 7-oxo substituted 3p-hydroxy- 
steroid or derivative thereof independently substituted at one or both of the 7-and 3- 
positions by an ester or ether group. 

20 

29. A method according to Claim 28 wherein said disorders are selected from 

(a) deficits of cognition in aging 

(b) Alzheimer's disease 

(c deficits of immune system in aging 
25 (a> deficits of immune function in HIV infection 

(e) glucocorticoid or mineralocorticoid excess 

(f) diabetes 

(g) depression 
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(h) osteoporosis and hypercalcemia 

(I) hyperglycemia and hyperlipodemia 

0) muscle atropy 

(k) arterosclerosis 

(1) steroid diabetes 



30. A method as claimed in claim 28 wherein the steroid possesses the carbon skeleton 
of cholesterol, androsterone, pregnenolone or estradiol and has a 3P-substituent-OR j and/or 
a 7a-substituent -OR 2 where -OR j and -OR 2 each independently represents a free hydroxy, 

0 ester or ether group, 

wherein each of Rj and R 2 are independently selected from the group consisting 
of hydrogen, substituted or unsubstituled alkyl groups, groups R5CO-, wherein R5 
may be selected from substituted or unsubstituted Cj_ 6 alkyl groups, and groups of the 
formula -OP(OH)3, wherein any substituents are selected from OH, halogen (F, CI, Br, I) 

5 amino, Cj_ 6 alkylamino, Cj_ 6 dialkylamino, COOH or COOR4 wherein R4 represents a 
C], 6 alkyl group;and wherein the compounds may be in free form or in the form of acid 
addition salts with pharmacologically acceptable anions. 

31. A 7ce-hydroxy or 7-oxo substituted 3p-hydroxy-steroid possessing the carbon 
skeleton of cholesterol, androsterone. pregnenolone or estradiol, or a derivative thereof 

0 substituted independently at one or both of the 7- and 3- positions with an ester or ether 
group for use in therapy. 

32. A steroid as claimed in claim 3 1 selected from 7a-hydroxydehydroepiandrosterone, 
7a-hydroxypregnenoIone and 7a-hydroxyestradiol. 



33. A pharmaceutical composition characterised in that it comprises a 7a-hydroxy or 
7-oxo substituted 3p-hydroxy steroid possessing the carbon skeleton of cholesterol, 
androsterone, pregnenolone or estradiol, or a derivative thereof substituted independently 
at one or both of the 7- and 3- positions with an ester or ether group, in association with a 
pharmaceutical^ acceptable carrier or diluent in a sterile and pyrogen free form. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT; 

(A) NAME: British Techology Group Ltd 

(B) STREET: 101 Newington Causeway 

(C) CITY: London 

(E) COUNTRY: GB 

(F) POSTAL CODE (ZIP) : SE1 6BU 

(A) NAME: Richard Frank LATHE 

(B) STREET: Centre for Genome Research, The University of 

Edinburgh 
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(F) POSTAL CODE (ZIP) : EH 9 3JQ 

(A) NAME: Ken A ROSE 

(B) STREET : Centre for Genome Research. The University of 

Edinburgh 

(C) CITY: Edinburgh 

(E) COUNTRY: GB 

(F) POSTAL CODE (ZIP) : EH9 3 JO 

(A) NAME: Jonathan Robert SECKL 

(B) STREET : Molecular Medicine Centre, The University of 

Edinburgh 

(C) CITY: Edinburgh 

(E) COUNTRY: GB 

(F) POSTAL CODE ( ZIP) : EH4 2XU 

(A) NAME: Ruth BEST 
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(C) CITY: Edinburgh 

(E) COUNTRY: GB 

(F) POSTAL CODE (ZIP): EH4 2XU 

(A) NAME: Joyce Lai wah YAU 

(B) STREET: Molecular Medicine Centre, The University of 
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(F) POSTAL CODE ( ZIP) : EH4 2XU 
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(B) STREET : Molecular Medicine Centre. The University of 
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(C) CITY : Edinburgh 

(E) COUNTRY : GB 

(F) POSTAL CODE (ZIP) : EH 4 2XU 



(ii) TITLE OF INVENTION: NEUROSTEROIDS 
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(iii) NUMBER OF SEQUENCES: 6 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE : Floppy disk 
5 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 <EPO) 

(V) CURRENT APPLICATION DATA; 

APPLICATION NUMBER: ***** 

0 (2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1763 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) , FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . .124 5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

GCC TTG GAG TAC CAG TAT GTA ATG AAA AAC CCA AAA CAA TTA AGC TTT 
48 

Ala Leu Glu Tyr Gin Tyr Val Met Lys Asn Pro Lys Gin Leu Ser Phe 
15 10 15 

GAG AAG TTC AGC CGA AGA TTA TCA GCG AAA GCC TTC TCT GTC AAG AAG 
96 

Glu Lys Phe Ser Arg Arg Leu Ser Ala Lys Ala Phe Ser Val Lys Lys 
20 25 30 

CTG CTA ACT AAT GAC GAC CTT AGC AAT GAC ATT CAC AGA GGC TAT CTT 144 
Leu Leu Thr Asn Asp Asp Leu Ser Asn Asp lie His Arg Gly Tyr Leu 
35 40 45 

CTT TTA CAA GGC AAA TCT CTG GAT GGT CTT CTG GAA ACC ATG ATC CAA 192 
Leu Leu Gin Gly Lys Ser Leu Asp Gly Leu Leu Glu Thr Met lie Gin 
50 55 60 

GAA GTA AAA GAA ATA TTT GAG TCC AGA CTG CTA AAA CTC ACA GAT TGG 240 
Glu Val Lys Glu lie Phe Glu Ser Arg Leu Leu Lys Leu Thr Asp Trp 
65 70 75 80 

AAT ACA GCA AGA GTA TTT GAT TTC TGT AGT TCA CTG GTA TTT GAA ATC 288 
Asn Thr Ala Arg Val Phe Asp Phe Cys Ser Ser Leu Val Phe Glu lie 
85 90 95 

ACA TTT ACA ACT ATA TAT GGA AAA ATT CTT GCT GCT AAC AAA AAA CAA 336 
Thr Phe Thr Thr lie Tyr Gly Lys He Leu Ala Ala Asn Lys Lys Gin 
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100 



105 



110 



„ »^ rtT TTT TTA AAA TTT GAT GAC CAT TTC 

ATT ATC AGT GAG CTG AGG GAT GAT TTT TTA AAA li 

lie lie Ser Glu Leu Arg Asp Asp Phe Leu Lys Phe Asp Asp Hxs Phe 



115 



120 



CCA TAC TTA GTA TCT GAC ATA CCT ATT CAG CTT CTA AGA AAT GCA GAA 
Pro Tyr Leu Val Ser As P He Pro lie Gin Leu Leu Arg Asn Ala Glu 

2 140 



130 



135 



TTT ATG CAG AAG AAA ATT ATA AAA TGT CTC ACA CCA GAA AAA GTA ^GCT 

Phi Zl Gin Lys Lys He He Lys Cys Leu Thr Pro Glu Lys Val Ala 



145 I 50 



CAG ATG CAA AGA CGG TCA GAA ATT GTT CAG GAG AGG <^ «« *™ 
Gin Met Gin Arg Arg Ser Glu He Val Gin Glu Arg Gin Glu Met Leu 



165 170 



195 200 

„ m « « T a, u « « » "J^^i^U 

Ala Met Tyr Tyr Leu Leu Gin His pro l,xu ^ 
210 215 

Z E S % S TLTLtttt 

230 2 3b 

s s s - K^^-^ -™ 

245 250 



384 



432 



480 



528 



576 



624 



672 



720 



768 



275 280 

AGC TAC CGT CTG CGG AAA GGA GAC TTT ™ ^^^^"^J* 
Ser Tyr Arg Leu Arg Lys Gly Asp Phe Val Ala Val Phe 

290 ^ 3 

_ _ « p Tr GAT GCT CCA AAG GAC TTT AGG TTT 

ATA CAC AAT GAC CCA GAA GTC TTC GOT ^ 

lie His Asn Asp Pro Glu Val Phe Asp Ala Pro Lys Asp g ^ 

310 



816 



864 



912 



960 



1008 
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325 330 335 

GGA AAA AAG CTG AAG AGT TAC ATT ATA CCA TTT GGA CTT GGA ACA AGC 1056 
Gly Lys Lys Leu Lys Ser Tyr lie lie Pro Phe Gly Leu Gly Thr Ser 
340 345 350 

AAA TGT CCA GGC AGA TAC TTT GCA ATT AAT GAA ATG AAG CTA CTA GTG 1104 
Lys Cys Pro Gly Arg Tyr Phe Ala He Asn Glu Met Lys Leu Leu Val 
355 360 365 

ATT ATA CTT TTA ACT TAT TTT GAT TTA GAA GTC ATT GAC ACT AAG CCT 1152 
He He Leu Leu Thr Tyr Phe Asp Leu Glu Val He Asp Thr Lys Pro 
370 375 380 

ATA GGA CTA AAC CAC AGT CGC ATG TTT CTG GGC ATT CAG CAT CCA GAC 12 00 

He Gly Leu Asn His Ser Arg Met Phe Leu Gly He Gin His Pro Asp 
38 5 390 395 400 

TCT GAC ATC TCA TTT AGG TAC AAG GCA AAA TCT TGG AGA TCC TGA 1245 
Ser Asp He Ser Phe Arg Tyr Lys Ala Lys Ser Trp Arg Ser * 
405 410 415 

AAGGGTGGCA GAG AAG CTT A GCGGAATAAG GCTGCACATG CTGAGCTCTG TGATTTGCTG 1305 

TACTCCCCAA ATGCAGCCAC TATTCTTGTT TGTTAGAAAA TGGCAAATTT TTATTTGATT 1365 

GCGATCCATC CAGTTTGTTT TGGGTCACAA AACCTGTCAT AAAATAAAGC GCTGTCATGG 1425 

TGTAAAAAAA TGTCATGGCA ATCATTTCAG GATAAGGTAA AATAACGTTT TCAAGTTTGT 1485 

ACTTACTATG ATTTTTATCA TTTGTAGTGA ATGTG CT TTT CCAGTAATAA ATTTGCGCCA 1545 

GGGTGATTTT TTTTAATTAC TGAAATCCTC TAATATCGGT TTTATGTGCT GCCAGAAAAG 1605 

TGTGCCATCA ATGGACAGTA TAACAATTTC CAGTTTTCCA GAGAAGGGAG AAATTAAGCC 1665 

CCATGAGTTA CGCTGTATAA AATTGTTCTC TTCAACTATA ATATCAATAA TGTCTATATC 172 5 

ACCAGGTTAC CTTTGCATTA AATCGAGTTT TGCAAAAG 1763 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 1880 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 81. .1604 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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GGCAGGCACA GCCTCTGGTC TAAGAAGAGA GGGCACTGTG CAGAAGCCAT CGCTCCCTAC 
60 

AGAGCCGCCA GCTCGTCGGG ATG CAG GGA GCC ACG ACC CTA GAT GCC GCC 

Met Gin Gly Ala Thr Thr Leu Asp Ala Ala 
- 420 425 

TCG CCA GGG CCT CTC GCC CTC CTA GGC CTT CTC TTT GCC GCC ACC TTA 
Ser Pro Gly Pro Leu Ala Leu Leu Gly Leu Leu Phe Ala Ala Thr Leu 
430 435 440 

CTG CTC TCG GCC CTG TTC CTC CTC ACC CGG CGC ACC AGG CGC CCT CGT 
10 Leu Leu Ser Ala Leu Phe Leu Leu Thr Arg Arg Thr Arg Arg Pro Arg 
445 450 455 

GAA CCA CCC TTG ATA AAA GGT TGG CTT CCT TAT CTT GGC ATG GCC CTG 
Glu Pro Pro Leu lie Lys Gly Trp Leu Pro Tyr Leu Gly Met Ala Leu 
460 465 470 

15 AAA TTC TTT AAG GAT CCG TTA ACT TTC TTG AAA ACT CTT CAA AGG CAA 
iys Phe Phe Lys Asp Pro Leu Thr Phe Leu Lys Thr Leu Gin Arg Gin 
475 460 485 

CAT GGT GAC ACT TTC ACT GTC TTC CTT GTG GGG AAG TAT ATA ACA TTT 
21 fly Asp Thrive Thr val Phe Leu Val Gly Lys Tyr He Thr Phe 
20 490 495 500 

GTT CTG AAC CCT TTC CAG TAC CAG TAT GTA ACG AAA AAC CCA AAA CAA 
Zl Zl Pro Phe Gin Tyr Gin Tyr Val Thr Lys Asn Pro Lys Gin 

510 515 "0 

TTA AGC TTT CAG AAG TTC AGC AGC CGA TTA TCA GCG AAA GCC TTC TCT 
25 Zu ser Phe Gin Lys Phe Ser Ser Arg Leu Ser Ala Lys Ala Phe Ser 
525 530 " 5 

GTA AAG AAG CTG CTT ACT GAT GAC GAC CTT AAT GAA GAC GTT CAC AGA 
Zl Tyl Lys Leu Leu Thr Asp Asp Asp Leu Asn Glu Asp Val Hi. Arg 
540 545 550 

30 GCC TAT CTA CTT CTA CAA GGC AAA CCT TTG GAT GCT ' CTT ■ ™ ™ ^ 
Ala Tyr Leu Leu Leu Gin Gly Lys Pro Leu Asp Ala Leu Leu Glu Thr 
555 560 565 

ATG ATC CAA GAA GTA AAA GAA TTA TTT GAG TCC CAA CTG CTA AAA ATC 
Z, 111 G?n Glu Val Lys Glu Leu Phe Glu Ser Gin Leu Leu Lys lie 
35 570 575 580 

ACA GAT TGG AAC ACA GAA AGA ATA ^^J^^er Teu^al 
Thr Asp Trp Asn Thr Glu Arg He Phe Ala Phe cys u y 
590 595 

' TTT GAG ATC ACA TTT GCG ACT CTA TAT GGA ^^^u^^Ln 
40 Phe Glu lie Thr Phe Ala Thr Leu Tyr Gly Lys He Leu Ala Gly Asn 
SOS 610 

A^r rn-r HAT TTT TTT AAA TTT GAT 
AAG AAA CAA ATT ATC AGT GAG CTA AGG GAT GAT TH 



110 



158 



206 



254 



302 



350 



398 



446 



494 



542 



590 



63 8 



686 



734 
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Lys Lys Gin He He Ser Glu Leu Arg Asp Asp Phe Phe Lys Phe Asp 
620 625 630 

GAC ATG TTC CCA TAC TTA GTA TCT GAC ATA CCT ATT CAG CTT CTA AGA 782 
Asp Met Phe Pro Tyr Leu Val Ser Asp He Pro He Gin Leu Leu Arg 
635 640 645 

AAT GAA GAA TCT ATG CAG AAG AAA ATT ATA AAA TGC CTC ACA TCA GAA 830 
Asn Glu Glu Ser Met Gin Lys Lys He He Lys Cys Leu Thr Ser Glu 
650 655 660 665 

AAA GTA GCT CAG ATG CAA GGA CAG TCA AAA ATT GTT CAG GAA AGC CAA 878 
Lys Val Ala Gin Met Gin Gly Gin Ser Lys He Val Gin Glu Ser Gin 
670 675 680 

GAT CTG CTG AAA AGA TAC TAT AGG CAT GAC GAT TCT GAA ATA GGA GCA 926 
Asp Leu Leu Lys Arg Tyr Tyr Arg His Asp Asp Ser Glu He Gly Ala 
685 690 695 

CAT CAT CTT GGC TTT CTC TGG GCC TCT CTA GCA AAC ACC ATT CCA GCT 974 
His His Leu Gly Phe Leu Trp Ala Ser Leu Ala Asn Thr He Pro Ala 
700 705 710 



ATG TTC TGG GCA ATG TAT TAT ATT CTT CGG CAT CCT GAA GCT ATG GAA 1022 
Met Phe Trp Ala Met Tyr Tyr He Leu Arg His Pro Glu Ala Met -Glu 
715 720 725 

GCC CTG CGT GAC GAA ATT GAC AGT TTC CTG CAG TCA ACA GGT CAA AAG 1070 
Ala Leu Arg Asp Glu He Asp Ser Phe Leu Gin Ser Thr Gly Gin Lys 
730 735 740 745 

AAA GGG CCT GGA ATT TCA GTC CAC TTC ACC AGA GAA CAA TTG GAC AGC 1118 
Lys Gly Pro Gly He Ser Val His Phe Thr Arg Glu Gin Leu Asp Ser 
750 755 760 

TTG GTC TGC CTG GAA AGC ACT ATT CTT GAG GTT CTG AGG CTG TGC TCA 1166 
Leu Val Cys Leu Glu Ser Thr He Leu Glu Val Leu Arg Leu Cys Ser 
765 770 775 

TAC TCC AGC ATC ATC CGA GAA GTG CAG GAG GAT ATG AAT CTC AGC TTA 1214 
Tyr Ser Ser He He Arg Glu Val Gin Glu Asp Met Asn Leu Ser Leu 
780 7B5 790 

GAG AGT AAG AGT TTC TCT CTG CGG AAA GGA GAT TTT GTA GCC CTC TTT 1262 
Glu Ser Lys Ser Phe Ser Leu Arg Lys Gly Asp Phe Val Ala Leu Phe 
795 BOO 805 

CCT CCA CTC ATA CAC AAT GAC CCG GAA ATC TTC GAT GCT CCA AAG GAA 1310 
Pro Pro Leu He His Asn Asp Pro Glu He Phe Asp Ala Pro Lys Glu 
810 815 820 825 

TTT AGG TTC GAT CGG TTC ATA GAA GAT GGT AAG AAG AAA AGC ACG TTT 1358 
Phe Arg Phe Asp Arg Phe He Glu Asp Gly Lys Lys Lys Ser Thr Phe 
830 835 840 

TTC AAA GGA GGG AAG AGG CTG AAG ACT TAC GTT ATG CCT TTT GGA CTC 14 06 

Phe Lys Gly Gly Lys Arg Leu Lys Thr Tyr Val Met Pro Phe Gly Leu 
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845 



850 855 



GGA ACA AGC AAA TGT CCA GGG AGA TAT TTT GCA GTG AAC GAA ATG AAG 14 54 

Gly Thr Ser Lys Cys Pro Gly Arg Tyr Phe Ala Val Asn Glu Met Lys 
860 865 870 



CTA CTG CTG ATT GAG CTT TTA ACT TAT TTT GAT TTA GAA ATT ATC GAC 1502 
Leu Leu Leu He Glu Leu Leu Thr Tyr Phe Asp Leu Glu He He Asp 
875 880 885 

AGG AAG CCT ATA GGG CTA AAT CAC AGT CGG ATG TTT TTA GGT ATT CAG 1550 
Arg Lys Pro He Gly Leu Asn His Ser Arg Met Phe Leu Gly He Gin 
890 895 900 905 

CAC CCC GAT TCT GCC GTC TCC TTT AGG TAC AAA GCA AAA TCT TGG AGA 1598 
His Pro Asp Ser Ala Val Ser Phe Arg Tyr Lys Ala Lys Ser-Trp Arg 
910 915 920 

AGC TGA AAGTGTGGCA GAGAAGCTTT GCAGAGTAAG GCTGCATGTG CTGAGCTCCG 1654 
Ser * 

TGATTTGGTG CACTCCCCCA AATGCAACCG CTACTCTTGT TTGAAAATGG CAAATTTATA 1714 

TTTGGTTGAG ATCAATCCAG TTGGTTTTGG GTCACAAAAC CTGTCATAAA ATAAAGCAGT 1774 

GTGATGGTTT AAAAAATGTC ATGGCAATCA TTTCAGGATA AGGTAAAATA ACATTTTCAA 1834 

GTTTGTACTT ACTATGATTT TTATCATTTG TAGTGAATGT GCTTTT 1880 



(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 831. .2078 

(ix) FEATURE: 

(A) NAME /KEY : exon (3) 

(B) LOCATION: 831. .1422 

(ix) FEATURE: 

(A) NAME /KEY : intron 

(B) LOCATION: 1423 . . 1B72 
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<ix) FEATURE: 

(A) NAME /KEY : exon (4) 

(B) LOCATION: 187 3 . .2078 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

5 GGATCCAACC AAGTTTCCAG ATCTTATAAA TGTGGTGAAT GGTGAATGAC TTCCTGAAGA 
60 

ATGGATGAAT GGATGTGTTC TAGTTTGGAA TCCTGTGTCA GTCACAAGTC AATATGTGAC 120 

CTTGAACATG TTATTAAATC TCCCACATCC ATAAAAGTGA AAATGCTGGC ATTAGTGGAT 180 

TTTTGCCAGT GTTGAATTAG ACATTTATTT GTGAGTACCT GCTCCATACA GTATGGTCAT 240 

0 TTATTTGAGT TAAAATTGTT GTATTTGAAC AAAACTCAGA TGACACCTAA GCATGAAAAA 3 00 

GCTCTTTATG AAGTATAAAT ACTCAGAAAT GGAATGGCAT GTTGCCAATT TGTTTTCTGC 360 

TTTATTGAGG GAAATATATG AGAAGTATTT AAGTCAGGGG ATTATGAGGA ATATTTAAAG 420 

GATANNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4 80 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 540 

5 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 600 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNTCTAGA GTGTTTTCCA CCATCTTTCA 660 

AAGGAAACAT GTAGTGTACC TTCGAATGAA ATGGATTTGT ATTAAACTTT TTGCCTTAGT 72 0 

TATTAGGGTC TTTCTAATTT TTGATTAACA TATTTTTTTA ATTTGTGGTG TTTATTTCTG 78 0 

TTTTTATTAA CAAACGAACT CATATGCTCC TCTCTCTTTT TTTTTTTTCT GGAAAGTACA 84 0 

0 TAACATTTAT ACCTGGACCC TTCCAGTACC AG CTAGTG AT AAAAAATCAT AAACAATTAA 900 

GCTTTCGAGT ATCTTCTAAT AAATTATCAG AGAAAGCATT TAGCATCAGT CAGTTGCAAA 960 

AAAATCATGA CATGAATGAT GAGCTTCACC TCTGCTATCA ATTTTTGCAA GGCAAATCTT 1020 

TGGACATACT CTTGGAAAGC ATGATGCAGA ATCTAAAACA AGTTTTTGAA CCCCAGCTGT 1080 

TAAAAACCAC AAGTTGGGAC ACGGCAGAAC TGTATCCATT CTGCAGCTCA ATAATATTTG 114 0 

AGATCACATT TACAACTATA TATGGAAAAG TTATTGTTTG TGACAACAAC AAATTTATTA 1200 

GTGAGCTAAG AGATGATTTT TTAAAATTTG ATGACAAGTT TGCATATTTA GTATCCAACA 1260 

TACCCATTGA GCTTCTAGGA AATGTCAAGT CTATTAGAGA GAAAATTATA AAATGCTTCT 1320 

CATCAGAAAA GTTAGCCAAG ATGCAAGGAT GGTCAGAAGT TTTTCAAAGC AGGCAAGATG 1380 

ACCTGGAGAA ATATTATGTG CACGAGGACC TTGAAATAGG AGGTAAGAAC TTCTGAATGA 144 0 

GCACTTGCCT AAATAAAAAT CATTTACATA GACCTCTGAA ATAAAAAAAG AC AAAATG G C 1500 
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GACCTTGAAA ATTTTTTTAT GCTCTTTCTA ATTGGCTAAT GATAAATGTT TACTCTGATA 1560 
TAACCTCTAT AATTGATATT TTTTTTTTTG CTGAGGTGGT AAACAGATAC TTAATGGTGA 
TAATGAGAAA GCGTATAACT AAGCTGCATT TATCCCTCTT ATCTCATCCC CGACCACACC 
GCCCCCCCCA TACACATTAC ATTTTAAACT ATTCTCATTA AGCAGAAAAT TAGACTTCAG 
AAGCCTATTG GTTCTCATTA GCATGCAGTG ATCCTTGGCT GGTCTGTGTC CTAACATCTT 
TTAATTAGCA CACTGCAAAT CTAATCAGTG TAATAAACGC TATTAATCTT CCTTTACACT I86 0 
TATTTTCTCC CACACATCAT TTAGGCTTTC TCTGGGCCTC TGTGGCAAAC ACTATTCCAA 
CTATGTTCTG GGCAACGTAT TATCTTCTGC GGCACCCAGA AGCTATGGCA GCAGTGCGTG 
ACGAAATTGA CCGTTTGCTG CAGTCAACAG GTCAAAAGGA AGGGTCTGGA TTTCCCATCC 
ACCTCACCAG AGAACAATTG GACAGCCTAA TCTGCCTAGG TAATTATTTT ATCTGTTATG 
AAGAAAGAAG GTACCTCTCT GCAAACTCGG TTTATCACTC ATAGCTGTTT ACAAGAGGTA 
GAGGACACAG CTGCTAATTG ACATAATAAC TCCCATTTAC ATCAATTATA AATTATGTAG 
TTTATAGCCG TAGATCATCT CATTGCATGT AAACATAAGG CCTATGTAAT TAACTGTGTA 
ATGTATGTAA AATTCTAACC AAAGCTTNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNKNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NK-NNKCCT GACTGAACTT CTTACTGCCA 
AAGTTAAATT CCATACCAAT GAGTTATTCT CI TCTC TGTATTGACA TTTC ATCTG C 

GGTATCCTTT AGGGTACAAT GAGTTATTCT CTA. /CTCTC TGTATTGACA TTTCATCTGC 
GGTATCCTTT AGGGTACAAT ATTCCAAGTT TCTTTAGACA AACGCAGGAA CAAATGTTCA 
CATATTTCTG TTTCTTTATT CCTTTGACAA GTAGGCGAGC ATTTTAGCCT ATGTTGGTCT 
CAAAAAAAAT CTTTTAAATA TGTTCCAGGT TCTTTAATGG GACCTTTCAG GAGCAAAAGT 



1620 
1680 
1740 
1800 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
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CCTCCCAGGT TTGGTCAATG TTCACCCTCN GTGGCCATTG AGGAAAATGC CCNNNNNGTT 3 24 0 

CTAGAGATTG TTCTCACTTC TCAGGCTAAG GCCCATTGAG CAATGCCAGA AAGCATGCCT 3300 

TATACTAGCA GTCAATTTGG AAGTTTGTAG TTTGTGTCTT TAGCATAGGT TATCAAATAA 3 360 

ATTTTATATT TNCTTTTAAA AAAATCTCAA CATTACTAAA ATACAAATAT CCTTTTATTT 3420 

5 TTCTTTGCAG AATTATCGGG GAACAAATCC AGAAAATTTG TGTAAATTTC GGGTAGTTGC 3480 

TCCACTTGAT ACACAGTATT TCTG CAT ATT GTAATTTCTA TGAAGATCTA GGTTGCATTT 3 540 

CCCATACATT CAAGCAGTTT CCATTGCATT TTTATGAATA AGATGACGCA TACTGGGAAG 3 600 

TAAGGCAAAT ACACTAAAAG GAATATGTGT TTGTATTCTG TATAGTTATT ACTCTTAAAA 3660 

AAAGTAGTTG TAATTCATCC ACTCTTTTTA CTTTCAACTT TTTGCTATTA AAAAATCATT 3720 

10 TTTAAATTTC AGTATTAAAG CAGAAACATT TAAATTTATT AGACCAGAAA AATAACAGAT 3 780 

TCTAGAACTA TAATTTGAAT CCATTTAAGC CCATAGCTAG AGCTAGAGAT TTTCACTATT 3 84 0 

GGATCC 3846 



(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 415 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

20 Ala Leu Glu Tyr Gin Tyr Val Met Lys Asn Pro Lys Gin Leu Ser Phe 
15 10 15 

Glu Lys Phe Ser Arg Arg Leu Ser Ala Lys Ala Phe Ser Val Lys Lys 
20 25 30 

Leu Leu Thr Asn Asp Asp Leu Ser Asn Asp He His Arg Gly Tyr Leu 
25 35 40 45 

Leu Leu Gin Gly Lys Ser Leu Asp Gly Leu Leu Glu Thr Met He Gin 
50 S5 60 

Glu Val Lys Glu He Phe Glu Ser Arg Leu Leu Lys Leu Thr Asp Trp 
65 70 75 80 

30 Asn Thr Ala Arg Val Phe Asp Phe Cys Ser Ser Leu Val Phe Glu He 

85 90 95 

Thr Phe Thr Thr He Tyr Gly Lys He Leu Ala Ala Asn Lys Lys Gin 
100 105 110 

He He Ser Glu Leu Arg Asp Asp Phe Leu Lys Phe Asp Asp His Phe 
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115 120 125 

Pro Tyr Leu Val Ser Asp lie Pro lie Gin Leu Leu Arg Asn Ala Glu 
130 135 "0 

Phe Met Gin Lys Lys lie lie Lys Cys Leu Thr Pro Glu Lys Val Ala 
5 145 150 155 160 

Gin Met Gin Arg Arg Ser Glu lie Val Gin Glu Arg Gin Glu Met Leu 
165 no 175 

Lys Lys Tyr Tyr Gly His Glu Glu Phe Glu He Gly Ala His His Leu 

180 185 19° 

10 Gly Leu Leu Trp Ala Ser Leu Ala Asn Thr lie Pro Ala Met Phe Trp 



195 



200 



205 



Ala Met Tyr Tyr Leu Leu Gin His Pro Glu Ala Met Glu Val Leu Arg 
210 215 220 

Asp Glu lie Asp Ser Phe Leu Gin Ser Thr Gly Gin Lys Lys Gly Pro 
15 225 230 235 240 

Gly He Ser Val His Phe Thr Arg Glu Gin Leu Asp Ser Leu val Cys 
24S 250 255 



Leu Glu Ser Ala 
260 



He Leu Glu Val Leu Arg Leu Cys Ser Tyr Ser Ser 



265 



270 



20 lie He Arg Glu Val Gin Glu Asp Met Asp Phe Ser Ser Glu Ser Arg 
275 280 285 

Ser Tyr Arg Leu Arg Lys Gly Asp Phe Val Ala Val Phe Pro Pro Met 
290 29S 300 

lie His Asn Asp Pro Glu Val Phe Asp Ala Pro Lys Asp Phe Arg Phe 
25 305 310 315 320 

Asp Arg Phe Val Glu Asp Gly Lys Lys Lys Thr Thr Phe Phe Lys Gly 
325 330 335 

Gly Lys Lys Leu Lys Ser Tyr He lie Pro Phe Gly Leu Gly Thr Ser 
340 345 

30 Lys Cys Pro Gly Arg Tyr Phe Ala He Asn Glu Met Lys Leu Leu Val 
355 360 365 

lie He Leu Leu Thr Tyr Phe Asp Leu Glu Val He Asp Thr Lys Pro 
370 375 380 

He Gly Leu Asn His Ser Arg Met Phe Leu Gly He Gin His Pro Asp 
35 385 390 395 

Ser Asp He Ser Phe Arg Tyr Lys Ala Lys Ser Trp Arg Ser * 
405 410 415 
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(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Gin Gly Ala Thr Thr Leu Asp Ala Ala Ser Pro Gly Pro Leu Ala 
15 10 15 

0 Leu Leu Gly Leu Leu Phe Ala Ala Thr Leu Leu Leu Ser Ala Leu Phe 
20 25 30 

Leu Leu Thr Arg Arg Thr Arg Arg Pro Arg Glu Pro Pro Leu lie Lys 
35 40 45 

Gly Trp Leu Pro Tyr Leu Gly Met Ala Leu Lys Phe Phe Lys Asp Pro 
5 50 55 60 

Leu Thr Phe Leu Lys Thr Leu Gin Arg Gin His Gly Asp Thr Phe Thr 
65 70 75 80 

Val Phe Leu Val Gly Lys Tyr lie Thr Phe Val Leu Asn Pro Phe Gin 
85 90 95 

Tyr Gin Tyr Val Thr Lys Asn Pro Lys Gin Leu Ser Phe Gin Lys Phe 
100 105 110 

Ser Ser Arg Leu Ser Ala Lys Ala Phe Ser Val Lys Lys Leu Leu Thr 
115 120 125 

Asp Asp Asp Leu Asn Glu Asp Val His Arg Ala Tyr Leu Leu Leu Gin 
130 135 140 

Gly Lys Pro Leu Asp Ala Leu Leu Glu Thr Met lie Gin Glu Val Lys 
145 150 155 160 

Glu Leu Phe Glu Ser Gin Leu Leu Lys lie Thr Asp Trp Asn Thr Glu 
165 170 175 

Arg He Phe Ala Phe Cys Gly Ser Leu Val Phe Glu He Thr Phe Ala 
180 185 190 

Thr Leu Tyr Gly Lys He Leu Ala Gly Asn Lys Lys Gin He He Ser 
195 200 205 

Glu Leu Arg Asp Asp Phe Phe Lys Phe Asp Asp Met Phe Pro Tyr Leu 
210 215 220 

Val Ser Asp He Pro He Gin Leu Leu Arg Asn Glu Glu Ser Met Gin 
225 230 235 240 

Lys Lys lie He Lys Cys Leu Thr Ser Glu Lys Val Ala Gin Met Gin 
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245 

Gly Gin ser Lys He v.l Gin Glu Ser Gin Asp Leu Leu Lys Arg Tyr 

260 265 



Tyr Arg His Asp Asp Ser Glu lie Gly Ala Hi- His Leu Gly Phe Leu 
S 275 280 

Trp Ala Ser Leu Ala Asn Thr He Pro Ala Met Phe Trp Ala Met Tyr 

290 295 
Tyr He Leu Arg His Pro Glu Ala Met Glu Ala Leu Arg Asp Glu lie 
305 31U 
10 Asp Ser Pne Leu Gin Ser Tnr Gly Gin Lys Lys Gly Pro Gly lie Ser 

325 iJU 

val His Phe Thr Arg Glu Gin Leu Asp Ser Leu Val Cys Leu Glu Ser 

340 345 
Thr He Leu Glu Val Leu Arg Leu Cys Ser Tyr Ser Ser He lie Arg 
15 355 360 

Glu val Gin Glu Asp Met Asn Leu Ser Leu Glu Ser Lys Ser Phe Ser 

375 

«. L. -v. «y «, ». *1 «a L.u Ph. Pro Pro «• »■ » 

390 

20 Z fro «. .i. «- »>p «• »=• ^ * h ' " 9 "I phe 

405 4iU 

Thr Phe Phe Lys Gly Gly Lys Arg 
lie Glu Asp Gly Lys Lys Lys Ser Thr Phe Pn y 

420 425 
Leu L vs Th, Tyr «1 »«c Pro P^ <Uy L.u dy «« Jjr ,y= cys Pro 
25 ' 435 440 

~i m=k i v-r Leu Leu Leu He Glu Leu 
Gly Arg Tyr Phe Ala Val Asn Glu Met Ly- Leu Le^ 

450 455 



~i ti. Tie Aso Arg Lys Pro He Gly Leu 

Leu Thr Tyr Phe Asp Leu Glu He He Asp Arg y „„ 

465 470 

„ Tla r i n His Pro Asp Ser Ala Val 

30 Asn His Ser Arg Met Phe Leu Gly He Gin ^ 



485 490 

Ser Phe Arg Tyr Lys Ala Lys Ser Trp Arg Ser 
500 50!> 
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(2) INFORMATION FOR SEQ ID NO: 6: 

\i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 6: 

Gly Lys Tyr lie Thr Phe lie Pro Gly Pro Phe Gin Tyr Gin Leu Val 
15 10 15 

lie Lys Asn His Lys Asn Leu Ser Phe Arg Val Ser Ser Asn Lys Leu 
20 25 30 

Ser Glu Lys Ala Phe Ser He Ser Gin Leu Gin Lys Asn His Asp Met 
35 40 45 

Asn Asp Glu Leu His Leu Cys Tyr Gin Phe Leu Gin Gly Lys Ser Leu 
50 55 60 

Asp He Leu Leu Glu Ser Met Met Gin Asn Leu Lys Gin Val Phe Glu 
65 70 75 80 

Pro Gin Leu Leu Lys Thr Thr Ser Trp Asp Thr Ala Glu Leu Tyr Pro 
85 90 95 

Phe Cys Ser Ser He He Phe Glu He Thr Phe Thr Thr He Tyr Gly 
100 105 HO 

Lys Val He Val Cys Asp Asn Asn Lys Phe He Ser Glu Leu Arg Asp 
115 120 125 

Asp Phe Leu Lys Phe Asp Asp Lys Phe Ala Tyr Leu Val Ser Asn lie 
130 135 140 

Pro He Glu Leu Leu Gly Asn Val Lys Ser He Arg Glu Lys He He 
145 150 155 160 

Lys Cys Phe Ser Ser Glu Lys Leu Ala Lys Met Gin Gly Trp Ser Glu 
165 170 175 

Val Phe Gin Ser Arg Gin Asp Asp Leu Glu Lys Tyr Tyr Val His Glu 
180 1B5 190 

Asp Leu Glu He Gly Ala His His Phe Gly Phe Leu Trp Val Ser Val 
195 200 205 



Ala Ser Thr He Pro Thr Met Phe Trp Ala Thr Tyr Tyr Leu Leu Arg 
210 215 220 
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His Pro Glu Ala 
225 

Gin Ser Thr Gly 



Arg Glu Gin Leu 
260 



Met Ala Ala Val 
230 

Gin Lys Glu Gly 
245 

Asp Ser Leu He 



Arg Asp Glu He 
235 

Ser Gly Phe Pro 

250 

Cys Leu 
265 



Asp Arg Leu Leu 
240 

He His Leu Thr 
255 
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